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Abstract 

We present a new method for estimating the frontier of a sample. 
The estimator is based on a local polynomial regression on the power- 
transformed data. We assume that the exponent of the transformation 
goes to infinity while the bandwidth goes to zero. We give conditions on 
these two parameters to obtain almost complete convergence. The asymp- 
totic conditional bias and variance of the estimator are provided and its 
good performance is illustrated on some finite sample situations. 
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1 Introduction 

Let (Xi,Yi), i = be independent and identically distributed continuous 

variables and suppose that their common density has a support defined by 

S^{{x,y)eRxR;0<y<g{x)}. 

The unknown function g is called the frontier. We address the problem of 
estimating g. In [13] . we introduced a new kind of estimator based upon kernel 
regression on high power-transformed data. More precisely the estimator of g{x) 
was defined by 

/ n In \ 1/P 

f (p -I- 1 ) ^ - ^) / E (^^ - ^ ) 1 

where p = p„ — > oo and h = hn ^ are non random sequences, if is a 
symmetrical probability density with support included in [—1.1], and Kh{') = 
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K {•/h) /h. Although the correcting term (p + 1)^^^ was specially designed to 
deal with the case of a uniform conditional distribution of Y/X = x, this esti- 
mate has been shown to converge in any case. In the special but interesting case 
of a uniform conditional distribution of Y/X = x for a a— lipschitzian frontier 
the minimax rate of convergence is attained. We also proved that the estimator 
is asymptotically Gaussian. It is also interesting to note that, compared to the 
extreme value based estimators [71 |H1 UHl [HI HH [H], projection estimators [TH] 
or piecewise polynomial estimators [211 HOI [H] , this estimator does not require 
a partition of the support S. 

A natural idea suggested by our referees was to investigate the possible 
gains obtained by substituting a local polynomial regression to the Nadaraya- 
Watson regression. The basic idea in this theory consists in approximating 
locally a Ck+i regression function by a polynomial of degree k and taking the 
zero-degree term as an estimate of the regression. The regularity of the function 
brings improvement on the bias term. Accordingly, when dealing with high 
power-transformed data we establish in this paper that the bias of the local 
polynomial estimator of degree k is Op{h{hp)^) and the variance is Op {1/nhp). 

Let us introduce the notations Z = {p + 1) Y^ and r„ (x) = E {Z/X ~ x). 
The conditional distribution of Y/X = x is supposed to be uniform on [0, g (x)], 
so that rn (x) = gP (x) . For fixed p the method for estimating r„ {x) first consists 
in solving the following minimization problem 

n / k \ ^ 

arg mmJ2i{p+l)YP-J2 ft iX^ ~ xf \ Kh{X,^x). (1) 

Then, denoting hy j3 — (/3o, the solution of this least square minimiza- 

tion, one considers /3o as an estimate of r„ [x) = E [Z/X ~ x). The originality 
and the difficulty of our paper in contrast with these traditional lines is that 
here p — pn ^ oo and that we consider 13q^^ as an estimate of g (x) . So we write 
dn {x) — PI^^ — rl/^ (x). We refer to [TBI 1151 US] for other definitions of local 
polynomials estimators (i.e. without high power transform) and to [3l |6l [9l [1] [2] 
for the estimation of frontier functions under monotonicity assumptions. 

In order to get simplified matricial expressions, let us denote by X the n x 
(fe -|- 1) matrix defined by the lines [l,Xi — x, {Xi — a;)*']i=i^...„. The diagonal 
matrix of weights diag{Kh [Xi — x)} is denoted by W. We call design the 
vector X — {Xi, X„)* and we denote by Z the vector (Zi, Z„)*. Then the 
local regression problem ([T|) can be rewritten as 

P = arg mm (Z-X/3)* W (Z-X/3) , 

where /? = (/3o, • . • , /3a:)*. It is well known from the weighted least square theory 
that 

/3 = (X*WX)"^X*WZ. 
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In particular, in the case k — we have 



f3 = l3o^J2 ^-^h {X, -x) {X, - x) , 

1=1 I i=l 

SO we exactly find back the estimator 5„ (a;) = I3q^^ studied in |13| . In order 
to give a general expression of f„ (a;), we adopt the notations of Fan and Gi- 
jbels whose book will also serve of reference for some preliminary results 
established in Section [2] (see also [22 for a general multidimensional analysis). 
Basing on this, the asymptotic conditional bias and variance of the estimator 
are derived in Section [3] when Y given X = x is uniformly distributed. This 
result is extended in Section |4l where the almost complete convergence is proved 
without this uniformity assumption. We conclude this paper by an illustration 
of the behavior of our estimator on some finite sample situations in Section [51 
Technical lemmas are postponed to the appendix. 



2 Preliminary results 

Let X e K. From now on, it is assumed that the density function / of Xi 
is continuous at x and that f(x) > 0. Besides, we suppose that there exists 
ffmin > such that, for all t e R, gmin < git)- Let S„ = X*WX be the 
(fc + 1) X (fc + 1) matrix [5„j+;] o<j,i<fe defined by 

Sn,j =J2i^^- Kh {X, - X) . 
i=l 

Similarly, denoting by S the nxn diagonal matrix diag {Kf^ {Xi — x) g'^^ (-^i)}, 
S; = X*SX is the (fc + 1) X (fc + 1) matrix [S^^j+i]Q<j.i<k with 

n 

S*n, = E - {X, - x) g^^ (X,) . 

i=l 

Finally, we introduce the matrices S = [/^j+i]o<j i<k ^^'^ ^* = [^j+i\Qi<j i<k with 
(Xj = J u^K [u) du and Vj = / (u) du. Following roughly the same lines as 
Fan and Gijbels [5j, we obtain asymptotic expressions for Sn.j and S*^ .j. The 
first equality is a standard result of the theory and the second one ([3]) boils 
down to an easy adaptation. Proofs are thus omitted. 

Proposition 1 // /i — > and nh oo, then 

Sn^j^nh^f{x)fi,[l + Op{l)]. (2) 

//, moreover, ph ^ we have for any Ci function g 

5;^. = nh^-'g^P (x) f (x) V, [1 + Op (1)] . (3) 
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Let us now quote a general expression of the conditional bias of Vn (x). From 
Fan and Gijbels |5], and denoting by ei = (1,0, ...,0)* the first vector of the 
canonical basis of M'^"'"^, we have 

so that 

E (r„ (x) IX) - r„ (x) = e* S-^X'W [E {Z j X) - X/3] . (4) 
In Appendix I we give a detailed proof of the following 

Proposition 2 Suppose g is a Ck+i function. If h ^ 0, nh — > oo and ph — >■ 0, 
then ^ ^ 

'vn {x) 



E 



. fn (x) 

We now examine the conditional variance of f„ (a;) 

V (f„ (x) /X) = e\S-^X'Y{WZ/X) XS^^ei. 

Taking into account of the independence of the pairs {Xi, Yi) , Y(WZ/X) is the 
diagonal matrix diag {K^ {Xi — x) V{Zi/Xi = a;)}. From the uniformity of the 
conditional distribution of the Yi/Xi = x, it is easily seen that V{Zi/Xi — x) = 

i^g'^^ {x), so that 

T? 



V (f„ (x) IX) = -J_e* S-is:S-iei. 



Following the same lines as Fan and Gijbels [5], we obtain the following asymp- 
totic expression 

Proposition 3 Suppose g is a Ck+i function. // — > 0, nh — > cxd and ph — > 0, 
then 

'rn{x),^\_ C 1 p2 



where C = e\S S*S ei. 

The proof of Proposition [3] is much easier than the one of Proposition [2] and it 
thus omitted. 



3 Conditional bias and variance of 



Here we present the main results of this paper and an outline of their proofs. 
Many details and ancillary results are postponed to Appendix II. Proofs are 
made under the assumption that g is a Ck+i function and the system of condi- 
tions below 

n — > oo, — > 0, p — > oo 
H : nh ^ oo, hp ^ 



{p/nh) log (nh) ^ (hp) 



2k+2 
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Theorem 1 Suppose H holds and g is a Ck+i function. Then, the asymptotic 
conditional bias of the estimate is given by 



E 



six) 



1/x) = Op {h {hpf) 



Proof. Let us write w„ {x) = r„ {x) /r„ (a;) — 1, so that g„ ix) / g{x) = {1 + Wn (x))^^^ 
and define 



(5) 



Let a„ = (p/nh)^^^ . For sufficiently large n we have a„ < 1/2, and thus, 
Lemma [9] entails ^ 

A„l {|w„ < a„} < C6-w^ (x) , (6) 

P 

which leads to the following bound 

E (A„l {\w^ {x)\ < «„} /X) < C6^E (x)| /X) < Cg— E^^ (^2 

p p 



Now, from Proposition [5] and Proposition [31 



rn (x) , \ „9 / r-n (x) 



Tn (x)' J ' V (x 



1/X 



-[l + o,(l)]+0,((M''+') 



/ (x) nh 2p + 

Then, taking into account of h {hp)^ y/nhp = ■\/log(n/i) — )■ oo, it follows that 



E(A„l{kn {x)\<a^}/X)<ce^ \ojP-)+Op({hpf''+' 

p L \nn/ \ 

= Op (^a„/ y/nhp^ + Op (^anh {hp)^^ 
= Op (a„h {hp)^^ . 

Besides, making use of Lemma [71 we can write 

E(A„l{|u;„ {x)\ > an} IX) < Cg {X)P{\wn {x)\ > an/X}, 

and, from the triangular inequality, 

P{\wn{x)\>a,jX}<¥{2\wn{x)-E{wn (x) /X)\>an/X} 
+ P{2\E{wn{x) /X)\>an/X}. 



1/2 



(7) 



Recalling that 



E{wn (x) /X)^E[ ^^^^ -1/X]= Op {{hpf+' 



and noticing that {hpf'^^ /«„ = [p/nhf'^^ (log(n/i))^^^ — >• 0, we conclude that 
the sequence P{2 |E('u;„ (a;) / X)\ > Un/X} goes to 0. Moreover, remark that 
P{2 |E {wn {x) /X)\ > ttn/X} is a {0, l}-valued random variable. This means 
that for a sufficient large n depending on we merely have 

¥{2\E{wn {x)/X)\ >an/X}^0. 

Now, from Lemma [51 

P (2 \Wn (X) - E {Wn (x) / X)\ > Oi^j X) 

{x) - E (f„ {x) /X)\ > ia„r„ (x) /X 
<2exp|-C4y^ [1 + 0^(1)] 
= 2exp l^-^^Jnh/p\og^{nh) [1 + Op (1)] \og{nh) 

where oOp (1) stands for a sequence going almost surely to the infinity. We thus 
have at least 

E{Anl{\wnix)\>an}/X)^Op{l/nh). (8) 
Collecting ^ and dH]) yields 

E (A„/A') = Op (a„/i (hp)^^ + Op (l/nh) . 

From 

<E(A„/A') 



E 



E ( - 1/x) - -E {wn (x) IX) 



9{x) J p 

and Proposition [2l we obtain 

'dn (x) 
9 



- l/X) = -E {wn (x) IX) + Op (oLnh {hpf \ + Op iXlnh) 

[x) J p V y 

^Op(h{hp)''^+Op{llnh). (9) 

Finally, since h{hp)^nh = {nh I p) (hp)'^^^ = yJnhJp\og{nh) — ^ oo, expan- 
sion dH]) reduces to 



E 



9n {x) 

9{x) 

and the conclusion follows. ■ 



II x\ = Op {h (hp)') 



Theorem 2 Suppose H holds and g is a Ck+i function. Then, the asymptotic 
conditional variance of the estimate is given by 

9\x) J \nhp 
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Proof. Introducing 

= —-T- - 1 , 

9[x) P 

we have ^ 

\ g[x) ) 

The first term is bounded using Proposition [S] 



Second, 

V {5/X) < E (Syx) - E (Al/X) , 
and yields, for sufficiently large n, 

All {\wn ix)\ < an} < cl^wt, (x) < cl^wl (x) , 

which entails 

E (A^^l {\wn {x)\ < an} IX) < cl^E {wl (x) /X) 



p2 



o,(iL)+o,(,w»« 



In a similar way as in the previous proof, one has 

E{All{\Wn{x)\ > an}/X) < C5 {X)F{\wn{x)\ > an/X} 



It follows that 

E {AllX) = O, + O, (aie {hpf^) + O, 

and, taking account of = (p/nh)^^'^ and nh/p\o^{nh) — s- oo, we finally 
obtain 



V 



^ \nhp 



and the result is proved. 
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Remark 1 Under the assumptions of the above theorems, the conditional mean 
square error is given by 



\9{x) J \9{x) J 

Under condition H, the ratio between the bias and variance terms is asymptot- 
icaUy equivalent to \og^ (nh). Thus, bias and variance of ^„ (x) are approxima- 
tively of same order, up to this logarithmic factor. 



9n jx) 
9{x) 



4 Convergence of ^„ {x) under general conditions 

In this section, the almost complete convergence of 5„ (x) is established without 
any assumption on the conditional distribution of Y given X. 

Theorem 3 If h ^ 0, p ^ oo, and nh/logn — )■ oo, then (x) converges to 
g (x) almost completely. 



Proof. Introducing 



X,, - X 



and s'n (x) = X]r=i ^ i-^i) Lemma [2] entails that f„ can be rewritten as 

f„ (x) = s„ (x) + s„ (x) Op (!) . 
Thus, with 2r] = e/g{x) and since [1 + Op(l)]^/P = [1 + Op(l)], we have 



{\9n{x)- g{x)\ >e} = 



(x) 



C 



9{x) 
s'J" (x) 



9ix) 



- 1 



>2ry| 



tj" (x) 



with 



fJ" jx) 
9{x) 



^a(XO(p+l) 



i=l 





p' 


.gix). 





r \ "^P 

1/p 



Opil) 



Since (1 +p)^^^ 1, let us focus on 



Tn (x) 



.9ix)_ 



8 



Taking < 5 < i], \Xi — x\ < h implies Yi — g (x) {1 + 6) < and thus 

i/p 



Y, 



l{Y,<g{x){l + 5)} 



9{x) 

n 

Y,a{X,)l{Y,<g{x){l + S)} 



<{l + 5) 

Moreover, since, for n large enough, j > 2, it follows that 
{r„ (x) > 1 + r/} 

C a (XO l{Y,<g (x) (1 + S)} > 2^ 

'-j:kux.-x)y.u/''^-'"' 



l{Y,<gix){l + S)} 



> 2 



Now, the only difference with the proof of Theorem 1 in [T3] is that the positive 
kernel K (x) is replaced by the signed kernel of higher order K (x) X^^^i 
The case {T„ (x) < 1 — t;} is easily treated in a similar way. ■ 



j=0 ^3-' 



5 Numerical experiments 

Here, the following model is simulated: X is uniformly distributed on [0, 1] and 
Y given X = x is distributed on [0,(7(0;)] such that 



(10) 



p(y >y\X ^x)^ [1- 



with 7 > 0. This conditional survival distribution function belongs to the 
WeibuU domain of attraction, with extreme value index —7, see 3] for a review 
on this topic. In the following, three exponents are used 7 G {1, 2, 3}. The case 
7 = 1 corresponds to the situation where Y given X = a; is uniformly distributed 
on [Q,g{x)]. The larger 7 is, the smaller the probability ([T0|) is, when y is close 
to the frontier g{x). The frontier function is given by 

5(2;) = (1/10 + sin(7ra;)) (ll/lO - exp (-64(x - 1/2)^) /2) . 

The following kernel is chosen 

K{t) = cos2(7rt/2)l{i e [-1, 1]}, 

and we limit ourselves to first order local polynomials, i.e. k = 1. In this 
case, to fulfill assumption H, one can choose h = c;in~^/^(log n)^+'^'^/^ and 
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p = Cp'n}^'^{\ogn)~^~'^ where r, cu and Cp are positive constants. In practice, 
since the choice of Ch and Cp is more important than the logarithmic factors, 
we use h = ^a{X)n-^l'^ and p = n^/"^. The multiplicative constants are chosen 
heuristically. The dependence with respect to the standard-deviation of X is 
inspired from the density estimation case. The scale factor 4 was chosen on the 
basis of intensive simulations, similarly to |13) . 
The experiment involves four steps: 

• First, m = 500 replications of a 500— sample are simulated. 

• For each of the m previous set of points, the frontier estimator ^„ is 
computed for k = 1. 

• The m associated Li distances to g are evaluated on a grid. 

• The smallest and largest Li errors are recorded. 

Results are depicted on Figure [THSl where the best situation (i.e. the estima- 
tion corresponding to the smallest Li error) and the worst situation (i.e. the 
estimation corresponding to the largest Li error) are represented. Worst situ- 
ations are obtained when no points were simulated at the upper boundary of 
the support. To overcome this problem, the normalizing constant {p + 1) in ([T]) 
could be modified as in |13| . Section 6 to deal with some particular parametric 
models of Y given X = x. 



Appendix I: Conditional bias of (x) 

In this appendix, we provide a proof of Proposition [21 From we have 
'rn (x) 



l/X]^ g-^ (x) elS-^X'W [E (Z/X) - X/3] , 
where the term E (Z/X) — X/3 can be rewritten as 

k k 

i=o j=o 
Taylor-Lagrange formula with Pj = jf (x) and < 6* < 1 yields 



g^iu) = Y^P,{u~xy + {u~xr'—---^ix + eiu-x)) 



k + l\ dx^+^ 



so that, we can derive, for < 9i < 1 depending on X^, the following expansion 

fc+ 1! 



(X,) - ^ /3, (X, - xY = (X, - xf+' T^i^ + 0^ {X, - x)) 



3=0 
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Since K has a bounded support, we have Kh {Xt — x) — for \Xi — x\ > h. If 
\Xi ~ x\ < h and Q < Oi < 1, under the conditions /i — > and ph ^ 0, Lemma [3] 
yields 

{X, - xY Ku {X, - x) {x + {X, ~ x)) 



{X, - xy Kh {X, - x) 



k+l 



Thus, recalhng that Snj = J2"=i i^i - {Xi - x) and = jr^- (x), 

the (fc + l)-dimensional vector X*W (E (Z/X) — X/3) can be rewritten as 



,i=l 



fc+1 



/3fe+l'S'n,fc+j + 



fc+1! 



n,fc-|-j + 



j"=i,...,fc+i 



i=i,...,fc+i 



Introducing the vector c„ = {Sn,k+\, Sn,2k+iY , we obtain 



fc+1 



i=i 



X*W (E (Z/A-) - X/3) = /3,+ic„ + ■^c„ ^ p^'/-^' (x) o (1) , 
and, returning to the bias of r„ (x) 
E 



fc+1 



(11) 

Recalhng that S„ = n/ (.x) HSH [1 + Op (1)] with H = diag{l,h, . . . ,h^), we 
have 



1 



H-^S-^H-^[l + Op(l)] 



Besides, introducing the vector c = (/^fc+i, ...,/i2fe+i), the asymptotic expression 
of Sn,j estabhshed in Proposition [T] entails 

c„ = n/i'=+ V (x) He [1 + Op (1)] . 

Let us first focus on the first term of the bias expansion (fTT|) : 

g-P (x) e\S-^l3k+iCn = g-^ (x) i3k+inh''+^ f (x) e\ll^S^Il^Ilc [1 + Op (1)] 

nf [X) 

= g-P {x) /i'=+Vfc+ie*iS-ic [1 + Op (1)] , 
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and using the expression of in dTSl) , we have 



k+i 



leading to 

Let us now consider the second term in 



(12) 



, fe+i 

— e*S-ic„^p^ff~^" (x)o(l) = e*S,;ic„p^-+io(l) 



S-iH-ic„/+Vp (1) 



Expanding H~^Cn we have 

n-^Cn = H-in/i'=+ V (x) He [1 + Op (1)] = nh^+^f [x) c[l + Op (1)] 
which entails 



1 



-{S-^YL'^Cnp'^+^Op (1) - e\S-^cOp ( (M 



n/ (a:) 



\k+l 



Op({hpf+'). (13) 



Collecting (fT2)) and (|13l) . we obtain the announced result 



E 



(a^) 



i^-i/^Uo,(,M'«). 



Appendix II: Auxiliary results 

We first quote a Bernstein-Frechet inequality adapted to our framework. 

Lemma 1 Let Xi,...,X„ independent centered random variables such that for 
each positive integers i and k, and for some positive constant C , we have 



Then, for every e > 0, we have 



(14) 



> e 



\ 



^EXf < 2exp. 



4 + 2eC/VElUW 



The proof is standard. Note that condition (|T4)) is verified under the bounded- 
ness assumption Vi > 1, \Xi\ < C. In the next lemma, an asymptotic expansion 
of the estimated regression function f„ (a;) — e*S^^X*WZ is introduced. 
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Lemma 2 The estimated regression function r„ (x) can be rewritten as 
where {uqUi, is t/ie first line of the matrix S 

Proof. It is known from the local polynomial fitting theory that r„ (x) = /3o = 
e\S^^x*WZ admits the following asymptotic expression 

where 

K*{t)^e[S-^l,t,...,t'')K{t) 

is the so-called equivalent kernel, see [5] . The remaining of the proof consists in 
explicitly writing this equivalent kernel. It is worth noticing that Op (1) depends 
exclusively of the design X. ■ 

The following lemma is dedicated to the control of the local variations of the 
derivatives of g^, when p — >■ oo, on a neighborhood of size h. 

Lemma 3 Suppose g is a Ck+i function with k < p. If, moreover, ph ^ and 
\u — v\ < h, then 



d'^+'^gP _ d'^+^gP 



fc+i 



Proof. From — pgP~^^ and a recurrence argument it is easily checked 
that 

where the 4'j are continuous functions. The triangular inequality entails 

\gP-' (u) cj,, {u) - gP-^ {v) <t>, {v)\ < \gP-' \cj,, (u) - cj,, {v)\ 

+ \^,{v)\\gP-^iu)-gP-^v)\, 

and, from Lemma |8l if p/i — > and |u — < h we get, for sufficiently large n, 

\gP-^ {u) (u) - gP-^ {v) 0, (^;)| < \gP-^ (u)| o(l) + c,D, \gP-^ {u)\ {p - j) h, 

where Dj = sup^g[„ |0j(s)|. Thus, 

gP-^ (v) 0, (v) = gP-^ (u) 0, (u) + gP-^ (u) [O {ph) + o(l)) 
= gP'^u)(j,,{u)+gP-^u)o{l), 
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and replacing in ([TS|) yields 

Qk+l p I 

= ^H+E(^.''^-^N''a)> 

and the result is proved. ■ 

Let us consider, for i = 1, . . . ,n the random variables defined by 
p^P (a:) 

The next two lemmas are preparing the application of the Bernstein-Frechet 
inequality given in Lemma [TJ First, it is established that the are bounded 
random variables. Second, a control of the conditional variance V (X]"=i ^i/^) 
is provided. 

Lemma 4 There exists a positive constant ci such that < C2 for all i = 
l,...,n. 

Proof. Since the kernel K is bounded and has bounded support, it is easily 
seen that a (Xj) = if \Xi — x\ > h and that a (Xi) — O (;^) uniformly in i. 
Noticing that < (Xi) and using Lemma [51 we get 

pgP [x) 

<^±^\a(X,)\\ip+l)gP{X,)\ (16) 
pgP {x) 

^ ^TT + 1) 9" ix) (1 + c,ph) O f 4 
pgP (x) \ nfi 

^ O {1) {1 + O {ph)) , 
and the result is proved. ■ 

Lemma 5 There exists a positive constant C3 such that 



nh 
P 



^e./A' Uc3[l + 0p(l)], (17) 



or equivalently, 

nh 



p I Mv(^E6/'^j -yyV^[l + o,(l)]. (18) 



14 



Proof. Recalling that 

V {{p + 1) Yt - gP (X,) /X) = V {ZJX) 
we can write 



2p+l ' 



{nhf 1 
52P [x) 2p + 

1 1 



g'^P [x) 2p + 


1 p (x) 


1 


1 


g'^P (x) 2p + 


1 p [x] 


1 


1 


g'^P (x) 2p + 


1 p {x) 



Y,Kl{X,-x) 



3=0 



Xi - X 



Xi - X 



jd=0 1=1 
k 



j+l 



3.1=0 

Now, substituting the asymptotic expression for S"* ^ into the above expression 
yields 



Ee./-^ 



nh 1 
2p+l/(x) 



E UjUlVj+l [1 + Op (1)] 



and the parts ^T7\ and HH]) of this lemma follow. ■ 

The next two lemmas are the key tools to prove Theorem [TJ Lemma [S] is mainly 
a consequence of the Bernstein- Frechet inequality given in Lemma [T] Lemma [7] 
is dedicated to the control of the random variable A„ introduced in ^ . 

Lemma 6 There exists a positive constant C4 such that for every e > 0, 

P(|f„ (x) -E(f„ {x) /X)\ > £r„ {x) IX) < 2exp |-C4ye' [1 + Op (1)]| , 

where the sequence [1 + Op (1)] depends exclusively on the design X . 

Proof. Following the asymptotic expression of f„ (x) in Lemma[21 we can write 

P (|f„ (x) - E (f„ (x) /X)\ > er„ (x) /X) 



J2aiX,){Z,-EiZjX)) 



[l + Op(l)] >er„ {x)/X 



J2'^iX^){{p+l)Yr-gPiX,)) 



>[l + 0p (1)] egP {x) IX 
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It is worth noticing that, conditionahy to X, the sequence Op (1) can be seen as a 
deterministic sequence converging to 0. We now introduce the bounded variables 
(see Lemma 2]). In accordance with the Bernstein- Frechet inequahty given in 
Lemma [1] and with the expressions (1171) and (llSp in Lemma [SI we write 

P (|f„ (x) - E (f„ (x) IX)\ > er„ (x) /X) 
+ Op (1)] e-/xj 
nh 



>e[l + Op (1)] 



£[l + Op(l)] 



nh 



< 2 exp < - 



pVv(i:r=iC./'^) 



4 + 2e [1 + Op (1)] 



nh 



c2/VV(Er=i6/A') 



2 exp < — 



[1 + 0,(1)] 



2 exp ■ 



4 + c2e[l + Op(l)]f /V(ELi6/A') 
£2|ic3[l + Op(l)] I 



4 + C2C3S [1 + Op (1)] 



f nh ^ ,^ , 
< 2exp<^ -C4 — £^ 1 + o„ 1 
I P 

and the conclusion fohows. ■ 



Lemma 7 T/ie random variable A„ is bounded conditionally to X , which means 
that there exists a positive constant, depending on the design, C5 (X) such that 
A„ < c5iX). 

Proof. From inequahty (|16l) . we have 
|f„(x)| = |e*S-iX*WZ| 



nf (x) 



X,,-xV 



[1 + Op (1)] 



< \^^\a{X,)\{p + l)gP{X,)^ [l + Op{l)] 

P 1 
= ci^^gP (x) [1 + Op (1)] -card{i : \Xi - x\ < h} . 

Then, the strong law of large numbers entails 

|r„(x)| <r„(x)ciJ[P(|X-a;| </i)] [l + Op(l)] 
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and from the continuity of the density /, we have 



l.F{\X-x\<h) = fix) + 



Consequently, 



< 2cip/ (x) [1 + Op (1)] 



with Op (1) depending on the design X. We thus write 



rn (x) 



- 1 



<C{X), 



(19) 



where C (X) is a positive constant under the conditioning hy X . As an imme- 
diate consequence, we get 



{l+Wn (x))^/P-l=Op(l) 



(20) 



From and (I^UI it is clear that A„ is bounded conditionally to X . ■ 
Finally, we quote two results from [13 (Lemma 5 and Lemma 4 respectively). 

Lemma 8 If ph 0, there exists a positive constant ci such that 

9" {x)<gP {y) + cig^ {y)ph 

for \x-y\< h. 

Lemma 9 There exists a constant cg such that \u\ < 1/2 entails 
(1 + uf^''' -l-u/p < ceu^/p. 
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(a) Best situation 




(b) Worst situation 



Figure 1: The frontier g (continuous line) and its estimation (dashed hne). The 
sample size is n = 500, X is uniformly distributed on [0, 1] and 7 = 1. 
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(a) Best situation 




(b) Worst situation 



Figure 2: The frontier g (continuous line) and its estimation (dashed hne). The 
sample size is n = 500, X is uniformly distributed on [0, 1] and 7 = 2. 
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(a) Best situation 




0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 



(b) Worst situation 

Figure 3: The frontier g (continuous line) and its estimation (dashed hne). The 
sample size is n = 500, X is uniformly distributed on [0, 1] and 7 = 3. 
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